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THE MAILING DATE OF THIS COMMUNICATION. 
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DETAILED ACTION 

Specification 

1 . The disclosure is objected to because of the following informalities: 
On page 14, line 10, there is an unmatched left parenthesis. 
Appropriate correction is required. 

2. The disclosure is objected to because it contains an embedded hyperlink and/or 
other form of browser-executable code. Applicants are required to delete the 
embedded hyperlink and/or other form of browser-executable code. See MPEP § 
608.01 . Page 6, Line 1 1 , of the Specification contains an embedded hyperlink. 

Claim Rejections - 35 USC § 102 

3. The following is a quotation of the appropriate paragraphs of 35 U.S.C. 102 that 
form the basis for the rejections under this section made in this Office action: 

A person shall be entitled to a patent unless - 

(b) the invention was patented or described in a printed publication in this or a foreign country or in public 
use or on sale in this country, more than one year prior to the date of application for patent in the United 
States. 

4. Claims 1, 7, 8, 14, and 15 are rejected under 35 U.S.C. 102(b) as being 
anticipated by Woodland et al. ("Iterative Unsupervised Adaptation Using Maximum 
Lil<elihood Linear Regression"), 

Regarding independent claims 1,8, and 15, Woodland et aL discloses a method, 
apparatus, and computer program for adaptation in speech recognition, comprising: 
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"providing at least one speech recognition model" - gender independent Hidden 
Markov Models (HMMs) HMM-1 and HMM-2 are built from acoustic training data sets 
consisting of 36,493 sentences (Page 1 134, Left Column, Paragraphs 4 to 7; Page 
1135, Right Column, Paragraph 3); 

"accepting speaker data" - test H3-P0 data was captured for each speaker of 20 
speakers (Page 1134, Right Column, Paragraph 7); 

"generating a word lattice based on the speaker data" - H3 development test 
data is used for lattice generation (Page 1 135, Left Column, Paragraphs 3 to 5: Table 
1); word lattices are used to generate an error rate for H3-P0 data (Page 1 136, Left 
Column, Lines 1 to 6: Table 3); 

"adapting at least one of the speaker data and the at least one speech 
recognition model in a manner to maximize the likelihood of the speaker data with 
respect to the generated word lattice" - language models were trained on the text 
training corpus and the H3 text data sets; HMM-1 models used global MLLR adaptation 
and specific MLLR adaptation from word lattices for H3-P0 data; the result is a 
decreased error rate by adapting HMM-1 ("speech recognition model") to H3 data 
("speaker data") using MLLR (Maximum Likelihood Linear Regression") (Page 1135, 
Right Column, Paragraph 5 to Page 1 136, Right Column, Paragraph 2: Table 3). 

Regarding claims 7 and 14, Woodland et aL discloses maximum likelihood linear 
regression (MLLR) for adaptation of speaker data in speech recognition (Page 1 133). 
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Claim Rejections - 35 USC § 103 

5. The following is a quotation of 35 U.S.C. 103(a) which forms the basis for all 
obviousness rejections set forth in this Office action: 

(a) A patent may not be obtained though the invention is not identically disclosed or described as set 
forth in section 102 of this title, if the differences between the subject nnatter sought to be patented and 
the prior art are such that the subject matter as a whole would have been obvious at the time the 
invention was made to a person having ordinary skill in the art to which said subject matter pertains. 
Patentability shall not be negatived by the manner in which the invention was made. 

6. Claims 2 to 6 and 9 to 13 are rejected under 35 U.S.C. 103(a) as being 
unpatentable over Woodland et ai in view of Nguyen et aL 

Concerning claims 2 and 9, Woodland et aL discloses generating word lattices, 
but omits generating word lattices by maximum a-posteriori adaptation. However, 
Nguyen et aL teaches adaptation by both Maximum Likelihood Linear Regression 
(MLLR) and Maximum A Posteriori (MAP) adaptation, noting that both techniques are 
available to perform adaptation. It is stated that Bayesian-based MAP techniques are 
particularly useful in dealing with adaptation of sparse data sets, but in practical 
applications, depending upon the amount of adaptation data available, a combination of 
both MLLR and MAP may be used. (Column 1 , Lines 50 to 60) Thus, Nguyen et aL 
performs adaptation with both MLLR and MAP. (Column 3, Line 57 to Column 4, Line 
32) It would have been obvious to one having ordinary skill in the art to generate a 
word lattice with maximum a posteriori adaptation as taught by Nguyen et aL in the 
MLLR adaptation with word lattices of Woodland et aL for the purpose of dealing with 
adaptation of sparse data sets in H3 training data. 
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Concerning claims 3 and 10, Nguyen etaL discloses Bayesian adaptation by 
MAP with Equation 4; y is the observed posterior probability of the observation to adapt 
the speech models ("posterior state occupancy probability"); Mmap is found by summing 
the observed posterior probabilities over time: I y(t) Ot and I Y(t) ("posterior word 
occupancy probabilities by summing over all states interior to a word") (column 4, lines 
10 to 23); the adaptation system then processes the segments in an N-best pass to 
collect the most probable labels; model adaptation may be performed to adapt speech 
models to words ("at least one likely word at each frame") (column 3, lines 1 to 8; 
column 3, lines 47 to 56). 

Concerning claims 4 and 1 1 , Woodland et ai discloses word lattices (Page 1 135, 
Left Column); a word lattice implicitly contains word traces. 

Concerning claims 5 and 12, Woodland et ai discloses pruning during adaptation 
(Paragraph Bridging Pages 1 135 to 1 136), but does not expressly discard 
interpretations associated with low confidence. However, Nguyen et ai teaches 
assigning weights to the N-best transcriptions, so that reliable information becomes 
enhanced by a positive weight, and unreliable information is correspondingly diminished 
by a negative weight. The system thus tends to push models that generate incorrect 
labels away from those that generate correct ones. Subsequently, model information is 
accumulated among the N-best transcriptions for the entire set of sentences and then 
used to adapt the speech models. (Column 3, Lines 32 to 56; Column 4, Lines 23 to 
59) Taking the N-best of the most reliable transcriptions necessarily implies eliminating 
transcriptions not associated with the N-best most reliable transcriptions ("discarding 
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interpretations associated with low confidence"). N-best techniques are well known in 
speech recognition. Nguyen et al. says assigning weights to the N-best transcriptions 
corresponding to their likelihoods produces a natural information and data corrective 
process. (Column 3, Lines 31 to 34) It would have been obvious to one having ordinary 
skill in the art to utilize the N-best technique of Nguyen et ai to discard unreliable 
transcriptions for pruning in the MLLR adaptation with word lattices of Woodland et al. 
for the purpose of producing a natural information corrective process. 

Concerning claims 6 and 13, Nguyen etal. discloses Bayesian adaptation by 
MAP with Equation 4; y is the observed posterior probability of the observation to adapt 
the speech models ("posterior phone probability") (column 4, lines 10 to 23); the 
observations and labels represent phonemes in speech recognition. 

Conclusion 

7. The prior art made of record and not relied upon is considered pertinent to 
Applicants' disclosure. 

Hon et al., Kuroda et al., Lee et al., Netsch, Mammone et al., Kimber et al., 
Digalakis et al., Meunier et al., and Gong disclose related art. 

Any inquiry concerning this communication or earlier communications from the 
examiner should be directed to Martin Lerner whose telephone number is (703) 308- 
9064. The examiner can normally be reached on 8:30 AM to 6:00 PM Monday to 
Thursday. 
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If attempts to reach the examiner by telephone are unsuccessful, the examiner's 
supervisor, Richemond Dorvil can be reached on (703) 305-9645. The fax phone 
number for the organization where this application or proceeding is assigned is 703- 



Information regarding the status of an application may be obtained from the 
Patent Application Information Retrieval (PAIR) system. Status information for 
published applications may be obtained from either Private PAIR or Public PAIR. 
Status information for unpublished applications is available through Private PAIR only. 
For more information about the PAIR system, see http://pair-direct.uspto.gov. Should 
you have questions on access to the Private PAIR system, contact the Electronic 
Business Center (EBC) at 866-217-9197 (toll-free). 



872-9306. 
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